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Implementation of a transform and of a subsequent 
quantization 

FIELD OF THE INVENTION 

The invention relates to the processing of digital data. It 
relates more specifically to a method for implementing an 
approximation of a discrete cosine transform (DCT) and a 
quantization, which transform and which quantization are to 
be applied subsequently to digital data, in particular 
digital image data, for compression of said digital data. It 
equally relates to a method for implementing a 
dequantization and an approximation of an inverse discrete 
cosine transform (IDCT) , wherein for decompression of 
digital data said quantization is to be applied in sequence 
with said inverse transform to compressed digital data. 
Finally, the invention relates to an encoder and to a 
decoder suited to carry out such a compression and such a 
decompression respectively. 

BACKGROUND OF THE INVENTION 

It is known from the state of the art to use a sequence of 
DCT and quantization for compressing digital data, for 
instance in order to enable an efficient transmission of 
this digital data. In particular a compression of digital 
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image data is commonly achieved by using a DCT followed by a 
quantization of the DCT coefficients obtained by the DCT. 

In a DCT of one -dimensional digital data, a respective 
sequence of source values of a predetermined number is 
transformed into transform coefficients. In video coding, 
the source values can be for instance pixel values or 
prediction error values. Each of the resulting transform 
coefficients represents a certain frequency range present in 
the source data. DCT of values /() into coefficients F() is 
defined as : 

^ = ^C(0|/(,)cos[fe^] ,i=0,l N-1 

In this equation, 1ST is the predetermined number of source 
values in one sequence of source values . 

For compression, image data is usually provided in blocks of 
two-dimensional digital data. For such data DCT is defined 
as : 

Hu)=j F cmtp(*,y)c°{l^) 0 {&^ ,i-j-o.i, 

W L 1,4*0 
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DCT is a separable operation. That means that a two- 
dimensional DCT can be calculated with two consecutive one- 
dimensional DCT operations. Using the one -dimensional DCT 
operation is preferred because the complexity of a one- 
dimensional DCT is relative to N, while the complexity of a 
two-dimensional DCT is relative to N J . For image data having 
a size of N*N, -the total complexity of all the DCT 
operations is relative to N 3 or N 2 log (N) for fast DCT. Thus 
large transforms, which also involve many non- trivial 
multiplications, are computationally very complex. 
Furthermore, the additionally required accuracy in bits may 
increase the word width. For complexity reasons DCT is 
commonly performed only for small block of values at a time, 
for example 4x4 or 8x8 values, which can be represented in 
form of a matrix with values f () . Figure 1 illustrates a DCT 
of such a 4x4 matrix 1. 

First, each row of the matrix 1 is transformed separately to 
form a once transformed matrix 2. In the depicted matrix 1, 
a separate transformation of each row is indicated by 2- 
headed arrows embracing all values of the respective row. 
Then, each column of the once transformed matrix 2 is 
transformed separately to form the final transformed matrix 
3 comprising the transform coefficients F() . In the depicted 
matrix 2, a separate transformation of each column is 
indicated by 2 -headed arrows embracing all values of the 
respective column. 

The DCT defined by the above equation can also be written in 
matrix form. To this end, F(i) is first written in a more 



suitable form 



^•)=E/W4^) ,1=0,1, ...,n-i 

Matrix A is a matrix of DCT basis functions. A two 
dimensional DCT can then be calculated with: 

Y = AXA T , 

where matrix X denotes a source value matrix, and where 
matrix Y denotes the transform coefficients resulting in the 
DCT. The index T of a matrix indicates that the transpose of 
the matrix is meant. 

After DCT, the actual compression is achieved by 
quantization of DCT coefficients . Quantization is achieved 
by dividing the transform coefficients with quantization 
values that depend on a quantization parameter qp: 

r{tj)=r(t,j)/Q(apXu), 

where Q(qp) is a quantization matrix, and where Y 1 (i,j) 
constitute quantized coefficients. The simplest form of 
quantization is uniform quantization where the quantization 
matrix is populated with one constant, for example: 
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The quantized coefficients constitute compressed digital 
data which has for example, after the encoding and possible 
further processing steps, a convenient form for transmission 
of said data. 

When the compressed data is to be presented again after 
storing and/or transmission, it has first to be decompressed 
again. 

Decompression is performed by reversing the operations done 
during compression. Thus, the quantized coefficients Y'{i,j) 
are inverse quantized in a first step by multiplying the 
quantized coefficients with values of quantization matrix: 

Next, the dequantized but still transformed coefficients 
Y(i,j) are inverse transformed in a second step by an 
inverse discrete cosine transform (IDCT) : 

X = A T YA , 

where matrix Y denotes as in the DCT the transformed 
coefficients, and where matrix X denotes the regained source 
value matrix , 
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If infinite precision is used for all calculations, X will 
contain exactly the original pixel values. In practice, 
however, the coefficients are converted to integer values at 
least after quantization and inverse transform. As a result, 
the original pixels can not be exactly reconstructed. The 
more compression is achieved, the more deviation there is 
from the original pixels. 

If the above described DCT and IDCT are implemented 
straight -forward, each conversion requires several 
multiplications, additions and/ or subtractions. These 
operations, however, require on the one hand a significant 
amount of processor time, and on the other hand, 
multiplications are quite expensive operations with respect 
to circuit area in some architectures. In order to be able 
to transmit for example high quality motion displays, it is 
thus desirable to dispose of a conversion process which 
requires fewer multiplication steps without reducing the 
quality of the data regained in decompression. 

Since the DCT is also a central operation in many image 
coding standards, it has been widely used, and a variety of 
solutions for the stated problem has been described in 
literature. These solutions generally feature the "butterfly 
operation" and/or combine some calculations from the 
operator matrix to the quantization step at the end of the 
DCT process. 

The us Patent 5,523,847 describes for example a digital 
image processor for color image compression, in order to 



reduce the number of non-trivial multiplications in DCT, it 
is proposed in this document to factor the transform matrix 
in a way that decreases the number of non- trivial 
multiplications, non-trivial multiplications being 
multiplications or divisions by a factor other than a power 
of two. The trivial multiplications can be realized by bit- 
shifting, hence the name 'trivial'. More specifically, the 
transform matrix is factored into a diagonal factor and a 
scaled factor such that the diagonal factor can be absorbed 
into a later quantization step that and the scaled factor 
can be multiplied by a data vector with a minimum of non- 
trivial multiplications. In addition, it is proposed that 
remaining non-trivial multiplications are approximated by 
multiplications by rational numbers, since the computation 
can then be achieved only with additions, subtractions and 
shift operations. This leads to a problem in IDCT, however, 
since with the approximation, there may not exist an exact 
inverse transform any more for the transform. Therefore, 
repeating the DCT-IDCT process may result in severe 
deterioration in image quality. This may happen, e.g., when 
the image is transmitted several times over a communications 
link where DCT compression is utilized, 

Another approach is described in a document by Gisle 
Bjontegaard: "H.2 6L Test Model Long Term Number 7 (TML-7) 
draftO", ITU Video Coding Experts Group, 13th Meeting, 
Austin, Texas, USA 2-4 April, 2001. This document describes 
a DCT solution constituting the current test model for a 
compression method for ITU-T recommendation H.26L. 
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According to this document, instead of DCT, an integer 
transform can be used which has basically the same coding 
property as a 4x4 DCT . In the integer transform, four 
transform coefficients are obtained from four source data 
pixels respectively by four linear equations summing the 
pixels with predetermined weights. The transform is followed 
or preceded by a quantization/ dequantization process, which 
performs a normal quantization/ dequantization. Moreover, a 
normalization is required, since it is more efficient to 
transmit data having a normalized distribution than to 
transmit random data . Since the transform does not contain a 
normalization, the quantization/ dequantization carries out 
in addition the normalization which is completed by a final 
shift after inverse transform. The quantization/ 
dequantization uses 32 different quality parameter (QP) 
values, which are arranged in a way that there is an 
increase in the step size of about 12% from one QP to the 
next. Disadvantage of this approach is that it requires a 
32-bit arithmetic and a rather high number of operations. 

Another document proceeds from the cited TML-7 document: "A 
16 -bit architecture for H.26L, treating DCT transforms and 
quantization", Document VCEG-M16 , Video Coding Experts Group 
(VCEG) 13th meeting, Austin, USA, 2-4 April, 20 01, by Jie 
Liang, Trac Tran, and Pankaj Topiwala. This VCEG-M16 
document mainly addresses a 4x4 transform, and proposes a 
fast approximation of the 4 -point DCT, named the binDCT, for 
the H.26L standard. This binDCT can be implemented with only 
addition and right-shift operations. The proposed solution 
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can be implemented to be fully invert ible for lossless 
coding . 

The proposed binDCT is based on the known Chen-Wang plane 
rotation-based factorization of the DCT matrix. For a 16-bit 
implementation of the binDCT, a lifting scheme is used to 
obtain a fast approximation of the DCT. Each employed 
lifting step is a biorthogonal transform, and its inverse 
also has a simple lifting structure. This means that to 
invert a lifting step, it is subtracted out what was added 
in at the forward transform. Hence, the original signal can 
still be perfectly reconstructed even if the floating-point 
multiplication results are rounded to integers in the 
lifting steps, as long as the same procedure is applied to 
both the forward and the inverse transform. 

To obtain a fast implementation, the floating-point lifting 
coefficients are further approximated by rational numbers in 
the format of k/2m, where k and m are integers, which can be 
implemented by only shift and addition operations. To 
further reduce the complexity of the lifting-based fast DCT, 
a scaled lifting structure is used to represent the plane 
rotation. The scaling factors can be absorbed in the 
quantization stage. 

The solution proposed in the VCEG-M16 document only requires 
16-bit operations assuming that the source values are 9 -bit 
values and less operations than the solution of the TML-7 
document. More specifically, it requires for a 1-D DCT of 
four data values 10 additions and 5 shifts. 
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Further documents relating to image data compression are for 
instance the following, the contents of which are only 
addressed briefly: 

US Patent 6,189,021, granted February 13, 2001, proposes to 
employ a set of scaled weighting coefficients in the 
intrinsic multiplication stage of a six-stage DCT fast 
algorithm for one of two one -dimensional DCT operations so 
that a corresponding stage of the DCT fast algorithm for the 
other one of the one -dimensional DCT operations can be 
omitted. 

US Patent 5,129,015, granted July 7, 1992, makes use of a 
method similar to DCT but employing a simpler arithmetic for 
compressing still images without multiplication. 

US Patent 5,572,236, granted November 5, 1996, relates to a 
digital image processor for color image compression which 
minimizes the number of non-trivial multiplications in the 
DCT process by rearranging the DCT process such that non- 
trivial multiplications are combined in a single process 
step. 

PCT application WO 01/31906, published May 3, 2001, relates 
to a transform-based image compression framework called 
local zerotree coding. 

SUMMARY OF THE INVENTION 
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lt is an object of the invention to reduce the operations 
required for conventional DCT and IDCT while avoiding the 
necessity of non-trivial multiplications. It is also an 
object of the invention to provide an alternative to known 
methods which requires less operations than the regular DCT 
or IDCT. It is further an object of the invention to 
guarantee a high quality of the digital data after 
decompression. 

For compression of digital data, the objects of the 
invention are reached by a method comprising in a first step 
simplifying a predetermined transform matrix to require less 
operations when applied to digital data. In a second step, 
elements of the simplified transform matrix constituting 
irrational numbers are approximated by rational numbers . In 
order to compensate for these measures, a predetermined 
quantization is extended to include the operations which 
were removed in the simplification of the predetermined 
transform matrix. The included operations are further 
adjusted to compensate for the approximation of elements of 
the simplified transform matrix by rational numbers. 
Finally, the simplified transform matrix with the 
approximated elements and the extended quantization are 
employed as basis for implementing a sequence of a transform 
and a quantization which are to be applied to digital data 
that is to be compressed. 

For decompression of compressed digital data, the objects 
are reached with a method in which a predetermined inverse 
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transform matrix is processed according to the predetermined 
transform matrix in the proposed method for compression. In 
addition, a predetermined dequantization is extended 
according to the extension of the predetermined quantization 
in the proposed method for compression, The resulting 
extended dequantization and the resulting simplified inverse 
transform matrix with approximated elements can then be used 
as basis for implementing a sequence of a dequantization and 
an inverse transform which are to be applied to compressed 
digital data for achieving a decompression. 

The objects of the invention are finally reached with an 
encoder comprising a corresponding transformer approximating 
a DCT and a corresponding quantization means, and with a 
decoder comprising a corresponding dequantization means and 
a corresponding transformer approximating an IDCT. 

The invention proceeds from the idea that the number of 
operations required for a DCT can be reduced significantly, 
if on the one hand, operations are extracted from the 
transform matrix and absorbed in the quantization, and if on 
the other hand, the remaining entries not constituting a 
rational number are approximated by such a rational number. 
In order to ensure that decompression can be carried out 
correctly, however, it is proposed that in addition the 
approximation is compensated in the operations moved to the 
quantization. 

It is an advantage of the invention that it enables a fast 
computation of the transform, since it enables a reduction 
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of the number of required operations, e.g, compared to TML-7 
and VCEG-M16, thus saving processor time. Still, the 
invention achieves a quality very close to the solution of 
the VCEG-M16 document. Since the invention moreover enables 
a consideration of the inversion properties of the transform 
operation by compensating the carried out approximations, a 
degradation of the quality of the processed data, especially 
at low bit races, can be avoided. It is thus a further 
advantage of the invention that a better inversion accuracy 
is achieved compared e.g. to US 5,523,847. 

Preferred embodiments of the invention become apparent from 
the subclaims . 

The proposed step of simplifying the predetermined transform 
matrix to require less operations when applied to digital 
data preferably comprises factoring the predetermined 
transform matrix into two factors, one factor constituting 
a diagonal matrix and the other factor constituting a 
simplified transform matrix. The diagonal matrix then 
comprises the operations removed from the simplified 
transform matrix. 

Advantageously, the rational numbers by which the remaining 
entries of the simplified transform matrix are approximated 
are given by fractions with a denominator of 2 n , wherein n 
is an integer. Such rational numbers are particularly suited 
for binary arithmetic, since a multiplication or division by 
2 n can be realized by bit-shift operations. Thus all 
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multiplications can be avoided in DCT with the proposed 
app r oxima t i on . 

For a given transform matrix, e.g. a 4x4 DCT matrix, and for 
a selected rational number for approximating remaining 
entries in the simplified matrix, a single set of equations 
including only additions, subtractions and shifts can then 
be determined for each, one -dimensional transform that has to 
be performed for transforming one- or two-dimensional 
digital data values , 

The adjustment to the approximation in the quantization step 
preferably ensures that the applied transform has an inverse 
transform. This can be achieved by ensuring that A T A=I, 
where matrix A is in this case a matrix for which all 
extracted operations were re-included in the simplified 
transform matrix after approximation of the remaining 
entries and after compensation of the approximation in the 
extracted operations , This guarantees that an approximation 
of an IDCT can be performed with good quality. 

For a given transform matrix, e.g. a 4x4 DCT matrix, and for 
a selected rational number for approximating remaining 
entries in the simplified matrix, also the required 
adjustment of specific values in the extracted operations 
can be calculated in a general way. 

However, in larger transforms like the 8x8 transform, it may 
additionally be necessary to adjust the approximations of 
the simplified matrix, since approximating the DCT transform 
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coefficients in an optimized implementation leads to a 
forward- inverse transform pair that is incomplete, leading 
to a "spill 11 of pixel values to non-adjacent, i.e. 2 pixels 
away, pixels. This results in blurring of the image. By a 
correct choice of the approximations, the off -diagonal 
elements of the matrix resulting from forward and inverse 
transform of a unity matrix are nullified. In very large 
matrices, it may not be possible to adjust the 
approximations sufficiently. In such cases, the 
approximations may be adjusted to produce an optimal result 
in the least-squares sense. Furthermore, optimization is 
needed to limit the solutions to a Dirichlet set, i.e., to 
certain rational numbers. 

For the proposed quantization step, preferably a 
quantization matrix is determined by multiplying a 
predetermined sequence of quantization coefficients with a 
matrix extracted from the transform matrix for simplifying 
the transform matrix. The extracted matrix comprises the 
operations extracted from the predetermined transform matrix 
and is adjusted to compensate for the approximation of 
remaining entries in the simplified transform matrix by 
rational numbers . 

The proposed method can be employed equally for one- 
dimensional as for a two-dimensional transforms. For a two- 
dimensional transform to be applied to two-dimensional 
digital data, simplification and approximation is carried 
out for the predetermined transform matrix and for the 
transpose of the simplified transform matrix. These two 
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matrices are then employed as basis for implementing the 
transform. The extended quantization includes in this case 
operations removed from both of said matrices, which 
operations are adjusted to compensate for the approximations 
in both of said matrices. 

The presented preferred embodiments can be employed not only 
for the method according to the invention for implementing 
transform and quantization, but in a corresponding manner 
also for the method for implementing inverse transform and 
dequantization, for the encoder and for the decoder 
according to the invention. 

In practice, the transform matrices calculated for transform 
can usually be used in transposed form for inverse transform 
without performing any separate calculations, if the 
operator matrix for the transform is unitary, like a DCT 
transform matrix. In this case, equally the operations to be 
included in the extended quantization can be used at the 
same time for the extension of a predetermined 
dequantization. 

The invention can be employed for the compression of any 
kind of digital data for any purpose, e.g. in a mobile 
communications system like GSM (Global System for Mobile 
communication) or UMTS (Universal Mobile Telecommunication 
System) , 

The invention can be implemented for example as a part of an 
MPEG codec . 
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Other objects and features of the present invention will 
become apparent from the following detailed description 
considered in conjunction with the accompanying drawings. 
It is to be understood, however, that the drawings are 
designed solely for purposes of illustration and not as a 
definition of the limits of the invention, for which 
reference should be made to the appended claims instead. 

BRIEF DESCRIPTION OF THE FIGURES 

In the following, the invention is explained in more detail 
with reference to drawings, of which 

Fig. 1 illustrates a DCT applied to a matrix of 4x4 
values; and 

Fig. 2 schematically shows a block diagram of an encoder 
and of a decoder employed for compressing and 
decompressing digital data according to an 
embodiment of the invention. 

DETAILED DESCRIPTION" OF THE INVENTION 

Figure 1 has already been described above. 

The block diagram of figure 2 includes components of an 
exemplary system in which the invention can be realized. On 
the left-hand side of figure 2, an encoder 4 is depicted. 
The encoder 4 is part of a first unit, e.g. a piece of user 
equipment of a mobile communications system, capable of 
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providing and transmitting video data. Connected between its 
input and its output, the encoder 4 comprises a DCT 
transformer 41, a quantization means 42 and additional means 
43. On the right-hand side of figure 2, a decoder 5 is 
depicted. The decoder 5 is part of a second unit, e.g. 
equally a piece of user equipment of a mobile communications 
system, capable of receiving and displaying video data. 
Connected between its input and its output, the decoder 5 
comprises a means 53, a dequantization means 53 and an IDCT 
transformer 51. 

In case video data is to be transmitted from the first unit 
to the second unit, e.g. via a communications network, the 
video data is provided to the encoder 4 of the first unit as 
digital data. In the encoder 4, the digital data is first 
transformed by the DCT transformer 41, and then quantized by 
the quantization means 42. After quantization, the data is 
further processed by the additional means 43 . This 
processing includes encoding of the quantized data for 
transmission, possibly preceded by a further compression, 
and is not dealt with in this document, since it is not 
relevant to the invention. 

The processed data is then transmitted from the first unit 
comprising the encoder 4 to the second unit comprising the 
decoder 5 . The second unit receives the data and forwards it 
to the decoder 5. In the decoder 5, in a first step some 
processing is carried out in the means 53, which processing 
corresponds in an inverse way to the processing in block 43 
of the encoder 4. Thus, the processing, which is not dealt 
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with in this document, may include decoding, followed 
possibly by a first step of decompression. The processed 
data is then deguantized by the dequantization means 52 and 
moreover subjected to an IDCT by the IDCT transformer 51. 
The regained video signals provided by the IDCT 51 are 
output by the decoder 5 for display by the second unit to a 
user. 

An embodiment of an implementation according to the 
invention of the DCT transformer 41 and of the quantization 
means 42 of the encoder 4 of figure 2 will now be derived. 
For this implementation, which proceeds from the DCT and the 
quantization as described in the background of the 
invention, it is assumed that compression of input digital 
image data is to be carried out for blocks of digital data 
comprising 4x4 values . An embodiment of an implementation 
according to the invention of a corresponding decompression 
by the dequantization means 52 and the IDCT transformer 51 
of the decoder 5 of figure 2 will be indicated as well. 

It is to be noted that in the presented equations, the same 
denomination may be employed in different equations for 
different matrices. The kind of the respective matrices will 
be indicated for each equation at least if a corresponding 
denomination was used before for another kind of matrix. 

In accordance with the above mentioned equation for DCT, Y = 
AX& 1 , the 4x4 forward DCT transform can be calculated as 
follows : 



-a -a 
-b b 



-c -a 
-b a 



where Y is the desired transformed matrix, where X is a 
matrix containing the 4x4 source values x £j (i,j = 1-4), and 
where A is the 4x4 DCT transform matrix. The values a, b and 
c of matrix A can be obtained easily from the above 
definition for A(i,x) : 



a = 1/2 

i = VV2-cos(^/8) 
c = VV2 -cos(3;r/S) 



In the equation for the forward DCT, matrix A can be 
factorized, resulting in a diagonal matrix B and a 
simplified transform matrix C. A corresponding factorization 
can be carried out for the transposed form of A, A T . If d is 
denoted in addition as d=c/b, the forward DCT can be written 
as 



Y=BCXC r B = 

'a o o ol ri i i i "I r Xn x L1 

0 b 0 0 ^ 1 d -d -I x 2l x 27 

0 0 a 0 * 1 -1 -1 1 • Xn 

[0 0 0 bj [d -1 1 -dj J*,, x 42 



x u x n~\ H 1 1 d 1 [a 0 0 0" 

x zi x u 3 ^ —1—1 0 b 0 0 

^ ^ I -d -I 1 * 0 0 a 0 

*« x *} [} - 1 1 -dj [o 0 0 b 



Since B is a diagonal matrix, the above equation can be 
written as 
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i M 1 1 

b b b b 



a a a a 
b b b b 



1 d 
1 -1 
d -1 



1 I 
I d 
1 -4 
1 -1 



d Ti 

-l 

i 



bah' 
b a b 
bob 
b a b_ 



where ® is used to indicate that the respective two 
matrices are multiplied entry-wise instead of a full matrix 
multiplication, 

After combining D and its transposed form D T into E, the 
final DCT is : 



Y = (CXC T ) ®£ = 
"l 1 1 1 
1 d —d -J 
1-1-1 1 
d -1 1 -c 



Y C ®E 



1 -d 
1 -1 



-1 1 
1 -d 



a 1 ab a 1 ab 

ab b 2 ab b 1 

a 2 ab a 1 ab 

ab b 1 ab b' 



In a next step, the coefficient d is converted into a fixed- 
point format which can be represented by a rational fraction 
with a denominator of 2 n . The value of d when considering 
eight decimal places is 0.41421356. Two of the possible 
fixed-point approximations for d are 3/8=0.375 and 
7/16=0.4375, both of which can be implemented with the same 
number of add and shift operations . More accurate 
approximations such as 13/32 , 27/64 and 53/128 require more 
additions and shifts but do not improve the achieved 
compression significantly in practice. Thus, 7/16, which is 
closer to d than 3/8, is selected as fixed-point format for 
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d. 



After converting coefficient d to a fixed-point 
representation, b has to be adjusted in a way that the 
transform has an inverse transform. The condition for an 
inverse transform to exist is given by: 



When solving the above equation for a matrix A which was 
reassembled from factors B and C after the approximation of 
d, the condition for an adjusted b is found to be: 



Matrix E is thus adjusted by substituting this new value for 
the old value of coefficient b. 

Now, a simplified DCT can be implemented in the DCT 
transformer 41 of encoder 4 from which matrix E was 
extracted. That is, the implementation is based on the 
equation Y c = CXC T , wherein matrix C comprises approximated 
coefficients d, resulting in modified DCT coefficients Y c , 
Matrix E will combined with the subsequent quantization 
step, as will be explained below. 

The actual implementation of the simplified DCT may depend 
on a specific transformer architecture, in which it may be 
most important to have few total operations or to have no 



A r A = I . 
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multiplications at all. 

Two different sets of equations that can be employed in an 
implementation of the simplified DCT transformer will now be 
proposed. The equations are suited to perform a 4 -point one- 
dimensional simplified DCT which is based on simplified DCT 
transform matrix C as derived above. In matrix C, 
coefficient d is chosen to be d=7/l£ , In both sets of 
equations, X[i], i=0-3 constitutes a sequence of 4 values 
that are to be transformed and Y[i], i=0-3 a sequence of 4 
transformed values, while e and f are auxiliary variables. 

The first proposed set of equations is: 

e = X[0] + X[3] 
f = X[l] + X(2] 
Y[0] = e + f 
Y[2] = e - f 
e = X[0] - X[3] 
f = X[l] - X [2] 
Y[l] = e + 7*f/lS 
Y[3] = 7*e/l6 - f 

The two divisions in this set of equations are actually bit 
shifts . This set of equation thus requires 8 additions 2 
multiplications and 2 shifts for a total of 12 operations. 

The second proposed set of equations is: 
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e = X[0] + X[3] 

f = X[1J + X£2J 

YtO) s e + f 

YE2] = e - f 

e = X[0] - x[3] 

f = X[l] - X[2] 

Y[l] = e + (f - f/8)/2 

Y[3] = (e - e/3)/2 - f 

The second set of equations uses only additions and shifts, 
and produces identical results to the first set. Again, 
divisions are actually bit shifts. This version requires 10 
additions and 4 shifts for total of 14 operations. The 
number of operations is larger than in the first version, 
but the resulting complexity is still lower if 
multiplication is an expensive operation. Moreover, the 
results of a multiplication require a larger dynamic range. 

Either set of equations can be used for transforming two- 
dimensional data by applying it to the respective set of 
values that is to be transformed. 

The simplified transform is followed by an adapted 
quantization step. The implementation of the quantization 
depends on the used DCT. In the above mentioned TML-7 
document, a uniform quantization is used. In fast DCT, a 
non-uniform quantization matrix must be used, since some of 
the DCT multiplications are combined with quantization 
multiplications. The above mentioned binDCT of document 
VCEG-MLS moreover uses divisions for quantization and 
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requires only 16-bit operations. A division, however, is 
generally a rather slow operation. Therefore, in the 
presented embodiment a uniform quantization using only 
multiplications is implemented in quantization means 42. 

As already mentioned above, quantization can be performed 
using division so that 

Y%j)=Y(ij)!0{qpli,j). 

Since division is a costly operation, multiplication can be 
used instead. For this purpose quantization matrix R is 
calculated as 

R{qp%j) = \A[Q{qplij) t 

after which quantization can be performed with 
multiplication: 

Yiij)=y(u)R(qp)(ij). 

In the quantization proposed in document TML-7, the 
quantization coefficients are approximately as shown below. 
Other coefficients could be used as well. 

a (qp) = 

2.5000, 2.8061, 3,1498, 3.5354, 3.9684, 4.4543, 4.9996, 
5.6120, 6.2992, 7.0706, 7.9364, 8.9082, 9.9990, 11.2234, 
12.597C, 14.1404, 15.8720, 17.8155, 19.9971, 22.4458, 
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25.1944, 28.2795, 31.7424, 35.6293, 39.9922, 44.8894, 
50.3863, 56.5562, 63.4817, 71.2552, 79.9806, 89.7745. 
In order to be able to absorb the matrix E extracted from 
the DCT in the quantization, a quantization matrix R for 
quantization parameter qp is calculated as 

The final quantized coefficients Y 1 (i,j) could now be 
determined from the transform coefficients Y c (i,j) resulting 
in the simplified DCT by 

r'(i,J) = r c VJ)-*(qp)(i,j)±f , 

where / is 1/3 for intra blocks and 1/6 for inter blocks and 
has the same sign as Y c (i,j) in accordance with the TML-7 
documentation. An intra block is a macroblock which is 
encoded based only on values within the current image, while 
an inter block is a macroblock which is encoded based in 
addition on values within other images. Each macroblock is 
composed of several subblocks, e.g. the blocks of 4x4 values 
of the presented example, which are DCT transformed and 
quantized separately. 

First, however, the quantization is changed to use only 
fixed-point values. To this end, the values of R and / are 
converced prior to quantization to fixed point values by 
multiplying them by 2 r * and by rounding the results to 
integer values . n is the number of fractional bits used for 
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the fixed-point values. By choosing 11=17, the coefficients 
of R will fit in 16-bit, and thus only 16-bit 
multiplications are required in quantization. More 
specifically, 16-bit multiplications are required that 
produce 32 -bit results. 

The fixed-point quantization is then implemented in 
quantization means 42 of encoder 4 based on the equation: 

r*V, J) = <y c (f . J) ■ R(w)(U) ±f)/r, 

where R and f comprise only fixed-point values. The values 
of Y 1 (i r j ) are output by the quantization means 42 as the 
desired compressed digital image data. 

For decompression of the compressed digital image data in 
the decoder 5 of figure 2, the dequantization means 52 and 
the IDCT transformer 53 are implemented in a corresponding 
way as the quantization means 42 and the DCT transformer 41 
of encoder 4. The basic IDCT is calculated from the basic 
DCT as : 

X = A T YA , 

where matrix X contains the desired, regained source values, 
where matrix A is the original DCT transform matrix, and 
where matrix Y contains dequantized values obtained by a 
decompression as described in the background of the 
invention. 
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Proceeding from this equation, the inverse transform can be 
formulated correspondingly to the forward transform using an 
extracted matrix E: 

x = c T (y®£)c . 

where matrices C and C T correspond to the matrices C and C T 
used for the reduced DCT in block 41 of the encoder 4. 

Matrix E of this equation can be absorbed in the 
dequatization step preceding the IDCT. 

This can be realized similarly to absorbing matrix E in 
quantisation. Dequantization coefficients are inverse values 
of the quantization coefficients. 

A dequantization matrix Q for a quantization parameter qp 
including matrix E can be calculated as: 

The compressed coefficients Y' (i,j) can therefore be 
dequantized to dequantized coefficients X' according to 

the equation: 

In this equation, X'(i,j) corresponds to the term Y®E in 
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the above equation for inverse transform X=C T (Y®E)C. 

When fixed-point numbers are used, values of Q are converted 
to fixed-point values prior to dequantization by multiplying 
them by 2 n and rounding results to integer values. By 
choosing n=5 for dequantization, all calculations in 
dequantization can be done using only 16-bit operations. 
Fixed-point dequantization is then implemented in 
dequantization means 52 of decoder 5 based on the equation: 

X\i,f) = Yii,jyQ{qp)(i,j) t 

in which Q contains only fixed-point numbers. After 
dequantization the values of X'(i,j) should be normalized by 
2 n , but normalization is postponed to be done after final 
IDCT in order to achieve a better accuracy. 

The simplified inverse transform can then be implemented in 
the IDCT transformer 51 of decoder 5 according to the 
equation : 

X = C T X'C 

after which fixed-point values of X are converted to integer 
values based on the equation: 

^a/) = (A'a J ;-) + 2 a - l )/2 a , 

where the division by 2 tt is realized with a simple 
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arithmetic bit-shift. 

The actual implementation of the IDCT transformer 51 can 
comprise a set of equations including only additions, 
subtractions and shifts corresponding to those presented for 
the DCT transformer 41. 

According to another embodiment of the invention, an 6x8 DCT 
and IDCT can be implemented as presented in the following. 
In accordance with the equation for DCT, Y = AXA T , the 8x8 
forward DCT transform can be calculated as follows: 
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• X*A T , 



where Y is the desired transformed matrix, where X is a 
matrix containing the 8x8 source values x y (i,j = 1-8), as 
above in the 4x4 embodiment, where A is the 8x8 DCT 
transform matrix, written open on the right-hand side of the 
equation, and where A T is the transpose of A. The values a, 
b, c, d, e, f and g of matrix A can be obtained easily from 
the above definition for A(i,x) : 
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a = 1/(2 VI) 
£ = l/2-cos(/r/16) 
c = l/2-cos(3^-/16) 
J = l/2-cos(5*716) 
e = l/2-cos(7^-/16) 
/ = l/2-cos(W/8) 
g = l/2-cos(3^/8) 



In the equation for the forward DCT, matrix A can be 
factorized, resulting in a diagonal matrix B and a 
simplified transform matrix C. A corresponding factorization 
can be carried out for the transposed form of A, A 1 . If we 
use the notation Xy = x/y, the forward DCT can be written as 



Y = 


BCXC 7 B 
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Since B is a diagonal matrix, the above equation can, as in 
the 4x4 case, be written as 
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Y = D®(CXC r )®D r = 
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where ® is used to indicate that the respective two 
matrices are multiplied entry-wise instead of a full matrix 
multiplication. 

After combining D and its transposed form D T into E, the 
final DCT is : 
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In a next step, the coefficients c^, d b , and g f are 
converted into a fixed-point format which can be represented 
by a rational fraction with a denominator of 2 n . Close 
approximations are c b =7/8, d fc =9/16, e 6 =3/16 and g f =7/l6. 
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After converting coefficients c b , d b , e b and g £ to a fixed- 
point representation, b and f have to be adjusted in a way 
that the transform has an inverse transform. The condition 
for an inverse transform to exist is again given by: 

A T A~I . 

When solving the above equation for a matrix A which was 
reassembled from factors B and C after the approximation of 
c b , c^, e b and g f/ the condition for adjusted b and f are 
found to be: 

i i 

Matrix E is thus adjusted by substituting these new value 
for the old value of coefficients b and f . 

However, to fulfill the condition A T A=I, the values for c b , 
d b and e t have to be chosen differently, since otherwise the 
product A T A features non-zero off -diagonal elements. The 
necessary condition for the 8x8 case is 

d t-c t +e b {d b +c s ) = 0, 

which can be fulfilled by choosing c b =15/16, d b =9/16 and 
e b =l/4. 
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Now, a simplified DCT can again be implemented in the DCT 
transformer 41 of encoder 4 from which matrix E was 
extracted. That is, the implementation is based on the 
equation Y c = CXC T , wherein matrix C comprises approximated 
coefficients c b , d b , e^, and g f , resulting in modified DCT 
coefficients Y c . Matrix E will be combined with the 
subsequent quantization step similarly as explained above 
for the 4x4 DCT. 

According to a third embodiment of the invention, the 
approximations are adjusted for the condition A T A=I by- 
optimizing the selection of fractional numbers in the 
transform. The fractional numbers are selected so that the 
off -diagonal elements of the matrix A T A are as close to zero 
as possible in the implementation sense. The solution of the 
optimization is limited to a Dirichlet set, i.e., to certain 
rational numbers . 

According to a fourth embodiment of the invention 
approximations of a fast-DCT algorithm are adjusted for the 
condition A T A=I by optimizing the selection of fractional 
numbers in the transform. 

In the whole, it becomes apparent from the described 
embodiments of the invention that efficient alternative 
implementations for compressing digital data are presented. 
The implementation can be realized to be either more 
accurate than known implementations or faster or both. 
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Thus, while there have been described and pointed out 
fundamental novel features of the invention as applied to a 
preferred embodiment thereof, it will be understood that 
various omissions and substitutions and changes in the form 
and details of the devices and methods described may be made 
by those skilled in the art without departing from the 
spirit of the invention. For example, it is expressly 
intended that all combinations of those elements and/or 
method steps which perform substantially the same function 
in substantially the same way to achieve the same results 
are within the scope of the invention. Moreover, it should 
be recognized that structures and/or elements and/ or method 
steps shown and/or described in connection with any 
disclosed form or embodiment of the invention may be 
incorporated in any other disclosed or described or 
suggested form or embodiment as a general matter of design 
choice. It is the intention, therefore, to be limited only 
as indicated by the scope of the claims appended hereto. 



